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OO ' Abstract 

o 

' The fc-LEAF Out-Branching problem is to find an out-branching (i.e. a rooted oriented 

■ spanning tree) with at least k leaves in a given digraph. The problem has recently received much 

^ ' attention from the viewpoint of parameterized algorithms [H El 13 I21j . In this paper we step 

Q . aside and take a kernelization based approach to the fc-LEAF-OUT-BRANCHiNG problem. We 

' give the first polynomial kernel for Rooted fc-LEAF-OUT-BRANCHiNG, a variant of A:-Leaf- 

Out-Branching where the root of the tree searched for is also a part of the input. Our kernel 
has cubic size and is obtained using extremal combinatorics. 

For the A:-Leaf-Out-Branching problem we show that no polynomial kernel is possible 
' unless polynomial hierarchy collapses to third level by applying a recent breakthrough result 

. by Bodlaender et al. [3] in a non-trivial fashion. However our positive results for Rooted 

' fc-LEAF-OuT-BRANCHiNG immediately imply that the seemingly intractable the A:-Leaf-Out- 

O . Branching problem admits a data reduction to n independent 0{k^) kernels. These two 

results, tractability and intractability side by side, are the first separating many-to-one kernel- 
, ization from Turing kernelization. This answers affirmatively an open problem regarding "cheat 

^ ' kernelization" raised in 
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1 Introduction 



00 ! Kernelization is a powerful and natural technique in the design of parameterized algorithms. The 
■ main idea of kernelization is to replace a given parameterized instance (/, k) of a problem n by 
a simpler instance {I',k') of IT in polynomial time, such that (/, fc) is a yes instance if and only 
^ I if {I',k') is a yes instance and the size of /' is bounded by a function of k alone. The reduced 
' instance /' is called the kernel for the problem. Typically kernelization algorithms work by applying 
reduction rules, which iteratively reduce the instance to an equivalent "smaller" instance. From this 
point of view, kernelization can be seen as pre-processing with an explicit performance guarantee, 
"a humble strategy for coping with hard problems, almost universally employed" |13j . 

A parameterized problem is said to have a polynomial kernel if we have a kernelization algorithm 
such that the size of the reduced instance obtained as its output is bounded by a polynomial of 
the parameter of the input. There are many parameterized problems for which polynomial, and 
even linear kernels are known [9l [HI \T2\ [T8\ I25j . Notable examples include a 2A;-sized kernel for 
/c- Vertex Cover [9], a 0{k'^) kernel for /c-Feedback Vertex Set [25] and a 67k kernel for 
/c-Planar-Dominating Set |8], among many others. While positive kernelization results have 
been around for quite a while, the first results ruling out polynomial kernels for parameterized 
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problems have appeared only recently. In a seminal paper Bodlaender et al. [3] have shown that a 
variety of important FPT problems cannot have polynomial kernels unless the polynomial hierarchy 
collapses to third level {PH = S^), a well known complexity theory hypothesis. Examples of such 
problems are /c-Path, A;-Minor Order Test, A;-Planar Graph Subgraph Test, and many 
others. However, while this negative result rules out the existence of a polynomial kernel for these 
problems it does not rule out the possibility of a kernelization algorithm reducing the instance to 
independent polynomial kernels. This raises the question of the relationship between many- 
to-one kernelization and Turing kernelization, a question raised in [3lll2^[T8]. That is, can we have 
a natural parameterized problem for which there is no polynomial kernel but we can "cheat" this 
lower bound by providing |/|'^'^^^ independent polynomial kernels. Besides of theoretical interest, 
this type of results would be very desirable from a practical point of view as well. In this paper, 
we address the issue of many-to-one kernelization versus Turing kernelization through /c-Leaf 
Out-Branching. 

The Maximum Leaf Spanning Tree problem on connected undirected graphs is to find a 
spanning tree with the maximum number of leaves in a given input graph G. The problem is well 
studied both from an algorithmic |16^ [23l [Mj [T3j and combinatorial [10^ [T7\ [20} [22] point of view. 
The problem has been studied from the parameterized complexity perspective as well [5| 1121 
An extension of Maximum Leaf Spanning Tree to directed graphs is defined as follows. We 
say that a subdigraph T of a digraph D is an out-tree if T is an oriented tree with only one vertex 
r of in-degree zero (called the root). The vertices of T of out-degree zero are called leaves. If T is 
a spanning out-tree, i.e. V{T) = V{D), then T is called an out-branching of D. The Directed 
Maximum Leaf Out-Branching problem is to find an out-branching in a given digraph with 
the maximum number of leaves. The parameterized version of the Directed Maximum Leaf 
Out-Branching problem is /c-Leaf Out-Branching, where one for a given digraph D and 
integer k is asked to decide whether D has an out-branching with at least k leaves. If we replace 
out-branching with out-tree in the definition of /c-Leaf Out-Branching we get the problem of 
fe-LEAF Out-Tree. 

Unlike its undirected counterpart, the study of /c-Leaf Out-Branching has only begun re- 
cently. Alon et al. [H [2] proved that the problem is fixed parameter tractable (FPT) by providing 
an algorithm deciding in time 0{f{k)n) whether a strongly connected digraph has an out-branching 
with at least k leaves. Bonsma and Dorn [6] extended this result to connected digraphs, and im- 
proved the running time of the algorithm. Very recently, Kneis et al. [21] provided parameterized 
algorithm solving the problem in time 4^n'-^^^\ In a related work Drescher and Vetta [H] described 
an a/OPT- approximation algorithm for the Directed Maximum Leaf Out-Branching prob- 
lem. Let us remark, that despite of similarities of directed and undirected variants of Maximum 
Leaf Spanning Tree, the directed case requires a totally different approach. The existence of a 
polynomial kernel for /c-Leaf Out-Branching has not been addressed until now. 

Our contribution. We prove that Rooted A;-Leaf Out-Branching, where for a given vertex 
r one asks for k-leaf out-branching rooted at r, admits a polynomial, in fact a 0{k'^), kernel. A 
similar result also holds for Rooted /c-Leaf Out-Tree, where we are looking for a rooted (not 
necessary spanning) tree with k leaves. While many polynomial kernels are known for undirected 
graphs, this is the first, to our knowledge, non-trivial parameterized problem on digraphs admitting 
a polynomial kernel. To obtain the kernel we establish a number of results on the structure of 
digraphs not having a k-leaf out-branching. These results may be of independent interest. 

In light of our positive results it is natural to suggest that A;-Leaf Out-Branching admits 
polynomial kernel as well. We find it a bit striking that this is not the case. We establish kernel- 
ization lower bounds by proving that unless PH = S^, there is no polynomial kernel for neither 
A;-Leaf Out-Branching nor ^-Leaf Out-Tree. While the main idea of our proof is based 
on the framework of Bodlaender et al. [1], our adaptation is non-trivial. In particular, we use 
the cubic kernel obtained for Rooted fc-LEAF Out-Branching to prove the lower bound. Our 
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Table 1: Our Results 

contributions are summarized in Tabled) 

Finally, let us remark that the polynomial kernels for the rooted versions of our problems 
provide a "cheat" solution for the poly-kernel-intractable /c-Leaf Out-Branching and /c-Leaf 
Out-Tree. Indeed, let D be a digraph on n vertices. By running the kernelization for the rooted 
version of the problem for every vertex of D as a root, we obtain n graphs where each of them has 
0{k^) vertices, such that at least one of them has a fc-leaf out-branching if and only if D does. 

2 Preliminaries 

Let -D be a directed graph or digraph for short. By V{D) and A{D) we represent vertex set and arc 
set respectively of D. Given a subset V C V{D) of a digraph D, by we mean the digraph 

induced on V' . A vertex ?/ of D is an in-neighbor {out-neighbor) of a vertex x ii yx £ A {xy G ^4). 
The in-degree (out-degree) of a vertex x is the number of its in-neighbors (out-neighbors) in D. 
Let P = piP2 ■ ■ - Pi be a given path. Then by P[piPj] we denote a subpath of P starting at vertex 
Pi and ending at vertex pj. For a given vertex q G V{D), by g-out-branching (or q-out-tree) we 
denote an out-branching (out-tree) of D rooted at vertex q. 

We say that the removal of an arc uv (or a vertex set S) disconnects a vertex w from the 
root r if every path from r to w in D contains arc uv (or one of the vertices in S). An arc uv is 
contracted as follows, add a new vertex u' , and for each arc wv or wu add the arc wu' and for an 
arc vw or uw add the arc u'w, remove all arcs incident to u and v and the vertices u and v. We 
say that a reduction rule is safe for a value k if whenever the rule is applied to an instance (D, k) 
to obtain an instance (D' , k'), D has an r-out-branching with at least k leaves if and only if D' has 
an r-out-branching with at least k' leaves. We also need the following. 

Proposition 2.1 Let D be a digraph and r be a vertex from which every vertex in V{D) is 
reachable. Then if we have an out-tree rooted at r with k leaves then we also have an out-branching 
rooted at r with k leaves. 

Let T be an out-tree of a digraph D. We say that u is a parent of v and f is a child of u if 
uv G A(T). We say that u is an ancestor of v if there is a directed path from u to f in T. An arc 
uv in A{D) \ A{T) is called a forward arc if u is an ancestor of v, a backward arc if v is an ancestor 
of u and a cross arc otherwise. Finally, parameterized decision problems are defined by specifying 
the input (/), the parameter (/c), and the question to be answered. A parameterized problem that 
can be solved in time /(A;)|/|'^(^) where / is a function of k alone is said to be fixed parameter 
tractable (FPT). 

3 Reduction Rules for RoOTED /c-Leaf Out-Branching 

In this section we give all the data reduction rules we apply on the given instance of Rooted 
/c-Leaf Out-Branching to shrink its size. 

Reduction Rule 1 [Reachability Rule] If there exists a vertex u which is disconnected from the 
root r, then return No. 
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Figure 1: An Illustration of Reduction Rule El 

For the Rooted /c-Leaf Out- Tree problem the Rule [T] translates into following: If a vertex u is 
disconnected from the root r, then remove u and all in- arcs to u and out-arcs from u. 

Reduction Rule 2 [Useless arc Rule] If vertex u disconnects a vertex v from the root r, then 
remove the arc vu. 

Lemma 3.1 Reduction Rules\^ andlE are safe. 

Reduction Rule 3 [Bridge Rule] // an arc uv disconnects at least two vertices from the root r, 
contract arc uv. 

Lemma 3.2 [★] Reduction Rule\Eis safe. 

Reduction Rule 4 [Avoidable Arc Rule] If a vertex set S, \S\ < 2, disconnects a vertex v from 
the root r, vw G A{D) and xw G ^(D) for all x £ S, then delete the arc vw. 

Lemma 3.3 [*] Reduction Rule\^is safe. 

Reduction Rule 5 [Two directional path Rule] If there is a path P = piP2 ■ • .pi-ipi with I = 7 
or I = 8 such that 

• pi and Pin G {pi-i,Pi} are the only vertices with in- arcs from the outside of P. 

• pi and Pout G {pi)P2} are the only vertices with out-arcs to the outside of P. 

• The path P is the unique out-branching of D[V{P)] rooted at pi. 

• There is a path Q that is the unique out-branching of D[V{P)] rooted at pin. 

• The vertex after pout on P is not the same as the vertex after pi on Q. 

Then delete R = P \ {pi,Pin,Pout,Pi} and all arcs incident to these vertices from D. Add two 
vertices u and v and the arc set {poutUjUVjVpmjPiVjVUjUpi} to D. 

Notice that every vertex on P has in-degree at most 2 and out-degree at most 2. Figure [1] gives 
an example of an application of Reduction Rule [5l 

Lemma 3.4 Reduction Rule\Mis safe. 

^Proofs of the results labeled with [-*] have been moved to appendix due to space restrictions. 
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Proof. Let D' be the graph obtained by performing Reduction Rule [S] to a path P in D. Let 
Pu be the path piPoutuvpmPi and Qy be the path PinPivupiPout- Notice that Pu is the unique 
out-branching of D'\V{Pu)\ rooted at pi and that Qy is the unique out-branching of D'\V{Pu)\ 
rooted at pin- 

Let T be an r-out-branching of D with at least k leaves. Notice that since P is the unique 
out-branching of -D[y(P)] rooted at pi, Q is the unique out-branching of -D[y(-P)] rooted at pm 
and pi and pin are the only vertices with in-arcs from the outside of P, T[y(P)] is either a path or 
the union of two vertex disjoint paths. Thus, T has at most two leaves in V{P) and at least one 
of the following three cases must apply. 

1. T[y(P)] is the path P from pi to pi. 

2. T\V{P)] is the path Q from pin to Pout- 

3. r[y(P)] is the vertex disjoint union of a path P that is a subpath of P rooted at pi, and a 
path Q that is a subpath of Q rooted at p^. 

In the first case we can replace the path P in T by the path P^ to get an r-out-branching of 
D' with at least k leaves. Similarly, in the second case, we can replace the path Q in T by the 
path Qy to get an r-out-branching of D' with at least k leaves. For the third case, observe that P 
must contain po^t since pout = Pi or pi appears before pout on Q and thus, pout can only be reached 
from pi. Similarly, Q must contain pi. Thus, T \ i? is an r-out-branching oi D\R. We build an 
r-out-branching T' of D' by taking T\R and letting u be the child of pout and v be the child of pi. 
In this case T and T' have same number of leaves outside of V{P) and T has at most two leaves 
in V{P) while both u and v are leaves in T' . Hence T' has at least k leaves. 

The proof for the reverse direction is similar and can be found in Appendix E ■ 

We say that a digraph D is a reduced instance of Rooted /c-Leaf Out-Branching if none of 
the reduction rules (Rules 1-5) can be applied to D. It is easy to observe from the description of 
the reduction rules that we can apply them in polynomial time, resulting in the following lemma. 

Lemma 3.5 For a digraph D on n vertices we can obtain a reduced instance D' in polynomial 
time. 

4 Polynomial Kernel: Bounding a Reduced No-instance 

In this section we show that any reduced no-instance of Rooted /c-Leaf Out-Branching must 
have at most O(fe^) vertices. In order to do so we start with T, a BFS-tree rooted at r, of a reduced 
instance D and look at a path P oi T such that every vertex on P has out-degree one in T. 

We bound the number of endpoints of arcs with one endpoint in P and one endpoint outside 
of P (Section 14. We then use these results to bound the size of any maximal path with every 
vertex having out-degree one in T (Section 14. 2p . Finally, we combine these results to bound the 
size of any reduced no-instance of Rooted A;-Leaf Out-Branching by 0(P). 

4.1 Bounding the Number of Entry and Exit Points of a Path 

Let -D be a reduced no-instance, and T be a BFS-tree rooted at r. The BFS tree T has at most 
k—1 leaves and hence at most k — 2 vertices with out-degree at least 2 in T. Now, let P = piP2 ■ ■ - Pi 
be a path in T such that all vertices in V{P) have out-degree 1 in T (P does not need to be a 
maximal path of T). Let Ti be the subtree of T induced by the vertices reachable from r in T 
without using vertices in P and let T2 be the subtree of T rooted at the child r2 of pi in T. Since 
T is a BFS-tree, it does not have any forward arcs, and thus pir2 is the only arc from P to T2. 
Thus all arcs originating in P and ending outside of P must have their endpoint in Ti . 



5 



Lemma 4.1 Let D he a reduced instance, T he a BFS-tree rooted at r, and P = PiP2---Pi be 
a path in T such that all vertices in V{P) have out-degree 1 in T . Let upi G -^iD), for some i 
between 1 and I, he an arc with u ^ P. There is a path P^p. from r to pi using the arc upi, such 
thatV{PupJnV{P)Q{pi,pi}. 

Proof. Let Ti be the subtree of T induced by the vertices reachable from r in T without using 
vertices in P and let T2 be the subtree of T rooted at the child r2 of pi in T. If u € V{Ti) there is 
a path from r to n avoiding P. Appending the arc upi to this path yields the desired path P^p. , so 
assume u G V(T2). If all paths from r to u use the arc pi-iPi then pi-ipi is an arc disconnecting 
pi and r2 from r, contradicting that Reduction Rule [3] can not be applied. Let P' be a path from 
r to u not using the arc pi^iPi- Let x be the last vertex from Ti visited by P' . Since P' avoids 
Pi~iPi we know that P' does not visit any vertices of P \ {pi} after x. We obtain the desired path 
Pupi by taking the path from r to x in Ti followed by the subpath of P' from x to u appended by 
the arc upi. ■ 

Corollary 4.2 Let D be a reduced no-instance, T he a BFS-tree rooted at r and P = piP2 ■ ■ - Pi be 
a path in T such that all vertices in V{P) have out- degree 1 m T. There are at most k vertices in 
P that are endpoints of arcs originating outside of P. 

Proof. Let S be the set of vertices in P \ {pi} that are endpoints of arcs originating outside of P. 
For the sake of contradiction suppose that there are at least /c + 1 vertices in P that are endpoints 
of arcs originating outside of P. Then \S\ > k. By Lemma |4. II there exists a path from the root r 
to every vertex in S, that avoids vertices of P \ {pi} as an intermediate vertex. Using these paths 
we can build an r-out-tree with every vertex in 5* as a leaf. This r-out-tree can be extended to a 
r-out-branching with at least k leaves by Proposition 12. H contradicting that D is a no-instance. ■ 

Lemma 4.3 Let D be a reduced no-instance, T he a BFS-tree rooted at r and P = piP2 ■ ■ - Pi he a 
path in T such that all vertices in V{P) have out-degree 1 in T. There are at most 7{k — 1) vertices 
outside of P that are endpoints of arcs originating in P. 

Proof. Let X be the set of vertices outside P which are out-neighbors of the vertices on P. Let P' 
be the path from r to pi in T and r2 be the unique child of pi in T. First, observe that since there 
are no forward arcs, r2 is the only out-neighbor of vertices in V{P) in the subtree of T rooted at 
r2- In order to bound the size of X, we differentiate between two kinds of out-neighbors of vertices 
on P. 

• Out-neighbors of P that are not in V{P'). 

• Out-neighbors of P in V{P'). 

First, observe that \X \ V{P')\ < k — 1. Otherwise we could have made an r-out-tree with at least 
k leaves by taking the path P'P and adding X \ V{P') as leaves with parents in V{P). 

In the rest of the proof we bound \X f] V{P')\. Let Y be the set of vertices on P' with out- 
degree at least 2 in T and let Pi, P2, . . . , Pt be the remaining subpaths of P' when vertices in Y 
are removed. For every i < t, Pi = ViiVi2 ■ ■ ■ Viq. We define the vertex set Z to contain the two last 
vertices of each path Pj. The number of vertices with out-degree at least 2 in T is upper bounded 
by A; — 2 as T has at most k — 1 leaves. Hence, \Y\ <k — 2,t<k — 1 and \Z\ < 2{k — 1). 

Claim 1 For every path Pi = ViiVi2 ■ ■ ■ Vig, 1 < i < t, there is either an arc UiViq-i or UiViq where 



6 



To see the claim observe that the removal of arc Viq-2Viq-i does not disconnect the root r from 
both Viq^i and Viq else Rule [3] would have been applicable to our reduced instance. For brevity 
assume that Viq is reachable from r after the removal of arc Viq-2Viq-i. Hence there exists a path 
from r to Viq. Let UiViq be the last arc of this path. The fact that the BFS tree T does not have 
any forward arcs implies that Ui ^ V{Pi). 

To every path Pi = ViiVi2 ■ ■ ■ Viq, 1 < i < t, we associate an interval li = ViiVi2 ■ ■ ■ Viq-2 and 
an arc UiViq', q' € {q — !,<?}• This arc exists by Claim [TJ Claim [1] and Lemma |4. II together imply 
that for every path Pj there is a path Pri from the root r to Viq' that does not use any vertex in 
V{Pi) \ {viq-i,Viq} as an intermediate vertex. That is, V{Pri n {V{Pi) \ {viq-i,Viq}) = 0. 

Let P/j be a subpath of P^ starting at a vertex Xj before vn on P' and ending in a vertex yi 
after Viq-2 on P' . We say that a path P^^ covers a vertex x if x is on the subpath of P' between 
Xi and Ui and we say that it covers an interval Ij if Xi appears before vji on the path P' and Ui 
appears after Vjq-2 on P'. Observe that the path covers the interval Ij. 

Let V = {P[,P2, . . . , P/} ^ {P'ri^ • • • ) P^t} be a minimum collection of paths, such that every 
interval li, 1 < i < t, is covered by at least one of the paths in V. Furthermore, let the paths of V 
be numbered by the appearance of their first vertex on P'. The minimality of V implies that for 
every P/ E V there is an interval I'^ E {/i, . . . ,It} such that P/ is the only path in V that covers 

n- 

Claim 2 For every 1 <i <l, no vertex of P' is covered by both P- and P/_|_3. 

The path P/^^ is the only path in V that covers the interval I'^j^^ and hence P/ does not cover the 
last vertex of Similarly P/_^2 only path in V that covers the interval I[j^2 ^'^d hence 

P/+3 does not cover the first vertex of /j'_(_2- Thus the set of vertices covered by both P/ and P/_|_3 
is empty. 

Since paths P/ and P/+3 do not cover a common vertex, we have that the end vertex of P/ 
appears before the start vertex of P/_|_3 on P' or is the same as the start vertex of P/^3. Partition 
the paths of V into three sets T'o,T'i,T'2, where path P/ E Vimods- Also let Tj be the set of intervals 
covered by Vi- Observe that every interval Ij, 1 < j < t, is part of some 2j for i E {0, 1,2}. 

Let i < 3 and consider an interval Ij E Xj. There is a path Pj/ E Vi that covers Ij such that 
both endpoints of Pj' and none of the inner vertices of Pj' lie on P'. Furthermore for any pair of 
paths Pa, Pb E Vi such that a < b, there is a subpath in P' from the endpoint of Pa to the starting 
point of Pb- Thus for every i < 3 there is a path P* from the root r to pi which does not use any 
vertex of the intervals covered by the paths in Vi . 

We now claim that the total number of vertices on intervals Ij, 1 < j < t, which are out- 
neighbors of vertices on V{P) is bounded by 3(A; — 1). If not, then for some i, the number of 
out-neighbors in Tj is at least k. Now we can make an r-out-tree with k leaves by taking any 
r-out-tree in D[V{P*) U ^(P)] and adding the out-neighbors of the vertices on V{P) in 2j as leaves 
with parents in V{P). 

Summing up the obtained upper bounds yields \X\ < {k — 1) + \{r2}\ + \Y\ + \Z\ + 3{k — 1) < 
(A: - 1) + 1 + (A; - 2) + 2(A; - 1) + 3{k - 1) = 7{k - 1), concluding the proof. ■ 

Remark: Observe that the path P used in Lemmas 14.11 and 14.31 and Corollary 14.21 need not be a 
maximal path in T with its vertices having out-degree one in T. 

4.2 Bounding the Length of a Path: On Paths through Nice Forests 

For a reduced instance D, a BFS tree T oi D rooted at r, let P = piP2 ■ ■ - Pi be a path in T such 
that all vertices in V{P) have out-degree 1 in T, and let S be the set of vertices in V{P) \ {pi} 
with an in-arc from the outside of P. 
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Definition 4.4 A subforest F = {V{P),A{F)) of D[V{P)] is said to be nice forest of P if the 
following three properties are satisfied: (a) F is a forest of directed trees rooted at vertices in S; 

(b) If PiPj G ^iP) o-n-d i < j then pi has out-degree at least 2 in F or pj has in-degree 1 in D; and 

(c) If PiPj £ A{F) and i > j then for all q > i, PqPj ^ A{D). 

In order to bound the size a reduced no-instance D we are going to consider a nice forest with 
the maximum number of leaves. However, in order to do this, we first need to show the existence 
of a nice forest of P. 

In the fohowing discussion let D be a reduced no-instance, T be a BFS tree T oi D rooted at 
r, P = piP2 ■ ■ - Pi be a path in T such that all vertices in V{P) have out-degree 1 in T and 5 be 
the set of vertices in V{P) \ {pi} with an in-arc from the outside of P. 

Lemma 4.5 [★] There is a nice forest in P. 

For a nice forest F of P, we define the set of key vertices of F to be the set of vertices in S, 
the leaves of F, the vertices of F with out-degree at least 2 and the set of vertices whose parent in 
F has out-degree at least 2. 

Lemma 4.6 [★] Let F be a nice forest of P. There are at most 5{k — 1) key vertices of F. 

We can now turn our attention to a nice forest F of P with the maximum number of leaves. 
Our goal is to show that if the key points of F are to spaced out on P then some of our reduction 
rules must apply. First, however, we need some more observations about the interplay between P 
and F. 

Observation 4.7 [Unique Path] For any two vertices pi, pj in V{P) such that i < j, PiPi+i ■ ■ - Pj 
is the only path from pi to pj in D[V{P)]. 

Proof. As T is a BFS-tree it has no forward arcs. So any vertex set X = {pi,P2, ■ ■ ■ ,Pq} with 
q < \V{P)\, the arc PqPq+i is the only arc in D from a vertex in X to a vertex in V{P) \ X. m 

Corollary 4.8 [*] No arc PiPi+i is a forward arc of F. 

Observation 4.9 Let ptpj be an arc in A{F) such that neither pt nor pj are key vertices, and 
* e {j - 1, j + 1, • • • , 0- Then for all q > t, pgPj A{D). 

Observation 14.91 follows directly from the definitions of a nice forest and key vertices. 

Observation 4.10 [*] If neither pi nor p^+i are key vertices, then either piPi+i ^ A{F) or 
Pi+iPi+2 i A{F). 

In the following discussion let F be a nice forest of P with the maximum number of leaves and 
let P' = PxPx+i . . .Pj; be a subpath of P containing no key vertices, and additionally having the 
property that Px^iPx ^ A{F) and PyPy+i ^ A{F). 

Lemma 4.11 [*] V{P') induces a directed path in F . 

In the following discussion let Q' be the directed path 

Observation 4.12 [★] For any pair of vertices Pi,Pj G V{P') if i < j — 2 then pj appears before 
Pi in Q'. 
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Lemma 4.13 [*] All arcs of D[V{P')] are contained in A{P') UA{F). 



Lemma 4.14 If \P'\ > 3 there are exactly 2 vertices in P' that are endpoints of arcs starting 
outside of P' . 

Proof. By Observation 14.71 Px-iPx is the only arc between {pi,P2, ■ ■ ■ ,Px-i} and P'. By Lemma 
14.111 i<'[y(P')] is a directed path Q'. Let pq be the first vertex on Q' and notice that the parent 
of Pq in F is outside of V{P'). Observation 14.12] imphes that q > y — 1. Hence pq and px are two 
distinct vertices that are endpoints of arcs starting outside of P' . It remains to prove that they 
are the only such vertices. Let pi be any vertex in P' \ {px^Pq}- By Lemma 14.111 V(P') induces 
a directed path Q' in F, and since pq is the first vertex of Q', the parent of pi in F is in V{P'). 
Observation 14.91 vields then that ptPi ^ A{D) for any t > y. ■ 

Observation 4.15 [★] Let Q' = F[V{P')]. For any pair of vertices u,v such that there is a path 
Q'[uv] from u to v in Q' , Q'[uv] is the unique path from u to v in D[V{P')]. 

Lemma 4.16 For any vertex x ^ V{P') there are at most 2 vertices in P' with arcs to x. 

Proof. Suppose there are 3 vertices PaiPb^Pc in V{P') such that a < b < c and such that 
PaX,pbX,pcX £ ^{D)- By Lemma [4.111 = F[V{P')] is a directed path. If Pa appears before 
Ph in Q' then Observation 14.12] implies that a + 1 = b and that ph has in-degree 1 in L>. Then pa 
separates pb from the root and hence Rule H] can be applied to remove the arc pbX contradicting 
that £) is a reduced instance. Hence pb appears before pa in Q'. By an identical argument pc 
appears before pb in Q'. 

Let Pb be a path in D from the root to pb and let u be the last vertex in Pb outside of V{P'). 
Let V be the vertex in Pb after u. By Lemma |4.14| u is either px or the first vertex pq of Q'. If 
u = Px then Observation 14.71 implies that Pb contains pa, whereas u = pq then Observation 14.15] 
implies that Pb contains pc- Thus the set {pa,Pc} separates pb from the root and hence Rule H] can 
be applied to remove the arc pbX contradicting that D is a reduced instance. ■ 

Corollary 4.17 [★] There are at most 14{k — 1) vertices in P' with out-neighbors outside of P' . 
Lemma 4.18 < 154(A; - 1) + 10. 

Proof. Assume for contradiction that \P'\ > 154(/i; — 1) + 10 and let X be the set of vertices in P' 
with arcs to vertices outside of P'. By Corollary 14.171 1^1 ^ 14(/c — 1). Hence there is a subpath 
of P' on at least 154(A; — 1) + 10/(14(A; — 1) + 1) = 9 vertices containing no vertices of X. By 
Observation 14.101 there is a subpath P" = paPa+i ... of -P' on 7 or 8 vertices such that neither 
Pa-iPa nor PbPb+i are arcs of F. By Lemma [4.111 F\y{P"']\ is a directed path Q" . Let pq and pt 
be the first and last vertices of Q" respectively. By Lemma 14.141 pa and pq are the only vertices 
with in-arcs from outside of P" . By Observation 14.12] pq G {pb-i^Pb} and pt E {pa-,Pa+i}- By 
the choice of P" no vertex of P" has an arc to a vertex outside of P' . Furthermore, since P" is a 
subpath of P' and Q" is a subpath of Q' Lemma 14.131 implies that pb and pt are the only vertices 
of P' with out- arcs to the outside of P" . By Lemma |4.7| the path P" is the unique out-branching 
of D\V{P")] rooted at pa- By Lemma l4.15t the path Q" is the unique out-branching of D\y{P")] 
rooted at pq. By Observation 14.121 ph_9 appears before Pa+2 in Q" and hence the vertex after pb in 
Q" and pt+i is not the same vertex. Thus Rule [5] can be applied on P" , contradicting that D is a 
reduced instance. ■ 

Lemma 4.19 [k] Let D be a reduced no-instance to RoOTED fc-LEAF Out-Branching. Then 
\V{D)\=0{k^). 
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Lemma ITT^ results in cubic kernel for RoOTED /c-Leaf Out-Branching as follows. 



Theorem 4.20 Rooted /c-Leaf Out-Branching and Rooted A;-Leaf Out-Tree admits a 
kernel of size 0{k^). 

Proof. Let D be the reduced instance of Rooted /c-Leaf Out-Branghing obtained in polyno- 
mial time using Lemma 13.51 If the size of D is more than 1540/;;^ then return Yes. Else we have 
an instance of size bounded by 0{k^). The correctness of this step follows from Lemma |4. 191 which 
shows that any reduced no-instance to RoOTED /c-Leaf Out-Branghing has size bounded by 
0{k^). The result for Rooted A;-Leaf Out-Tree follows similarly. ■ 

5 Kernelization Lower Bounds 

In the last section we gave a cubic kernel for Rooted A;-Leaf Out-Branghing, it is natural to 
ask whether the closely related /c-Leaf Out-Branghing has a polynomial kernel. The answer to 
this question, somewhat surprisingly, is no, unless an unlikely collapse of complexity classes occurs. 
To show this we utilize a recent result of Bodlaender et al. [3] that states that any compositional 
parameterized problem does not have a polynomial kernel unless the polynomial hierarchy collapses 
to the third level. 

Definition 5.1 (Composition [4]) A composition algorithm for a parameterized problem L C 
S* X N is an algorithm that 

• receives as input a sequence ((xi, k), . . . , (xt, k)), with (xj. A:) G S* x N"*" for each 1 <i <t, 

• uses time polynomial in 'Y^^^i \ xi\ + k, 

• and outputs {y, k') € S* x N"*" with 

1. {y, k') G L <^=^ (xj, k) £ L for some 1 < i < t. 

2. k' is polynomial in k. 

A parameterized problem is compositional if there is a composition algorithm for it. 
Now we state the main result of [1] which we need for our purpose. 

Theorem 5.2 ([4]) Let L be a compositional parameterized language whose unparameterized ver- 
sion L is NP-complete then unless PH=S|, there is no polynomial kernel for L. 

Theorem 5.3 /c-Leaf Out-Tree has no polynomial kernel unless PH=Sp. 

Proof. The problem is NP-complete [1]. We prove that it is compositional and thus. Theorem 15.21 
will imply the statement of the theorem. A simple composition algorithm for this problem is as 
follows. On input (Di, k), {D2, k), . . . , {Dt, k) output the instance {D, k) where D is the disjoint 
union Di, . . . , Df. Since an out-tree must be completely contained in a connected component of 
the underlying undirected graph of D, {D,k) is a yes instance to /c-Leaf Out-Tree if and only 
if any out of fc), 1 < i < t, is. This concludes the proof. ■ 

A willow graph [11], D = {V,Ai U A2) is a directed graph such that D' = {V,Ai) is a directed 
path P = P1P2 ■ ■ - Pn on all vertices of D and D" = {V,A2) is a directed acyclic graph with one 
vertex r of in-degree 0, such that every arc of A2 is a backwards arc of P. pi is called the bottom 
vertex of the willow, p„ is called the top of the willow and P is called the stem. A nice willow 
graph D = iV^Ai U A2) is a willow graph where pnPn-i and pnPn-2 are arcs of D, neither Pn-i 
nor Pn-2 are incident to any other arcs of A2 and D" = (F, ^2) has a ^j^-out-branching. 
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Observation 5.4 [★] Let D = {V,Ai U A2) be a nice willow graph. Every out-branching of D with 
the maximum number of leaves is rooted at the top vertex p„. 

Lemma 5.5 [*] /c-Leaf Out-Tree in nice willow graphs is NP-complete under Karp reductions. 

Theorem 5.6 A;-Leaf Out-Branching has no polynomial kernel unless PH=5]p. 

Proof. We prove that if /c-Leaf Out-Branching has a polynomial kernel then so does /c-Leaf 
Out-Tree. Let (D, k) be an instance to /c-Leaf Out-Tree. For every vertex u G 1/ we make 
an instance {D,v,k) to RoOTED fe-LEAF Out-Tree. Clearly, {D,k) is a yes instance for /c-Leaf 
Out-Tree if and only if {D,v,k) is a yes instance to Rooted /c-Leaf Out-Tree for some 
V £ V. By Theorem 14.201 RoOTED /c-Leaf Out-Tree has a 0(/c^) kernel, so we can apply the 
kernelization algorithm for RoOTED /c-Leaf Out-Tree separately on each of the n instances 
of Rooted /c-Leaf Out-Tree to get n instances {Di,vi,k), {D2,v2,k), {Dn,Vn,k) with 
= 0{k^) for each i < n. By Lemma [531 /c-Leaf Out-Branching in nice willow graphs 
is NP-complete under Karp reductions so we can reduce each instance {Di,Vi,k) of Rooted k- 
Leaf Out-Tree to an instance {Wi,bi) of /c-Leaf Out-Branching in nice willow graphs in 
polynomial time in \Di\, and hence in polynomial time in k. Thus, in each such instance, bi < k^ 
for some fixed, consta^nt c independent of botli n and k. Let bfnax 

= maxj<„6i. Without loss of 

generality bi = bmax for every i. This assumption is safe because if it does not hold we can modify 
the instance {Wi, bi) by replacing bi with bmax, subdividing the last arc of the stem bmax — h times 
and adding an edge from rj to each subdivision vertex. 

From the instances (Wi,bmax), ■ ■ iWn,bmax) we build an instance {D',bmax + 1) to /c-Leaf 
Out-Branching. Let rj and Sj be the top and bottom vertices of Wi respectively. We build D' 
simply by taking the disjoint union of the willows graphs Wi, W2, ■ ■ ■ , Wn and adding in an arc 
rjSj+i for i < n and the arc r„si. Let C be the directed cycle in D obtained by taking the stem of 
D' and adding the arc r„si. 

If for any i < n, Wi has an out-branching with at least bmax leaves, then Wi has an out- 
branching rooted at with at least bmax leaves. We can extend this to an out-branching of D' 
with at least bmax + 1 leaves by following C from rj. In the other direction suppose D' has an 
out-branching T with at least bmax + 1 leaves. Let i be the integer such that the root r of T is in 
V^(Wi). For any vertex v in V{D') outside of V{Wi), the only path from r to in D' is the directed 
path from r to v in C. Hence T has at most 1 leaf outside of V{Wi). Thus T[l/(VFi)] contains an 
out-tree with at least bmax leaves. 

By assumption, A;-Leaf Out-Branching has a polynomial kernel. Hence we can apply a 
kernelization algorithm to get an instance {D",k") of /c-Leaf Out-Branching with |y(Z)")| < 
^max for a constant C2 independent of n and bmax such that {D" , k") is a yes instance if and only 
if {D',bmax) is- 

Finally, since /c-Leaf Out-Tree is NP-complete we can reduce {D", k") to an instance {D*, k*) 
of A;-Leaf Out-Tree in polynomial time. Hence k* < \V{D*)\ < \V{D")\'^^ < k'^* for some fixed 
constants C3 and C4. Hence we conclude that if /c-Leaf Out-Branching has a polynomial kernel 
then so does A;-Leaf Out- Tree. Thus, Theorem 15.31 implies that /c-Leaf Out-Branching has 
no polynomial kernel unless PH=Sp. ■ 

6 Conclusion and Discussions 

In this paper we demonstrate that Turing kernelization is a more poweful technique than many- 
to-one kernelization. We showed that while /c-Leaf Out-Branching and /c-Leaf Out-Tree do 
not have a polynomial kernel unless an unlikely collapse of complexity classes occurs, they do have 
n independent cubic kernels. Our paper raises far more questions than it answers. We believe that 
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there are many more problems waiting to be addressed from the viewpoint of Turing kernehzation. 
A few concrete open problems in this direction are as follows. 

• Is there a framework to rule out the possibility of |/|'^'^^) polynomial kernels similar to the 
framework developed in [3]? 

• Which other problems admit a Turing kernehzation like the cubic kernels for /c-Leaf Out- 
Branching and /c-Leaf Out-Tree obtained here? 

• Does there exist a problem for which we do not have a linear many-to-one kernel, but does 
have linear kernels from the viewpoint of Turing kernehzation? 
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A Proofs moved from Section [3] 



Lemma 3.1 Reduction RulesUl andlM are safe. 

Proof. If there exists a vertex which can not be reached from the root r then a digraph can not 
have any r-out-branching. For Reduction RuleO all paths from r to v contain the vertex u and 
thus the arc vu is a back arc in any r-out-branching of D. m 

Lemma 3.2 Reduction Rulel3\is safe. 

Proof. Let the arc uv disconnect at least two vertices v and w from r and let D' be the digraph 
obtained from D by contracting the arc uv. Let T be an r-out-branching of D with at least k 
leaves. Since every path from r to w contains the arc uv, T contains uv as well and neither u nor 
V are leaves of T. Let T' be the tree obtained from T by contracting uv. T' is an r-out-branching 
of D' with at least k leaves. 

In the opposite direction, let T' be an r-out-branching of D' with at least k leaves. Let u' be 
the vertex in D' obtained by contracting the arc uv, and let x be the parent of u' in T' . Notice 
that the arc xu' in T' was initially the arc xu before the contraction of uv, since there is no path 
from r to V avoiding u in D. We make an r-out-branching T oi D from T' , by replacing the vertex 
u' by the vertices u and v and adding the arcs xu, uv and arc sets {vy : u'y G A[T') Avy G A{D)} 
and {uy : u'y G A{T') Avy ^ A{D)}. All these arcs belong to A{D) because all out-neighbors of 
u' in D' are out-neighbors either of u or oi v in D. Finally u' must be an inner vertex of T' since 
u' disconnects w from r. Hence T has at least as many leaves as T'. ■ 

Lemma 3.3 Reduction Rule\^is safe. 

Proof. Let D' be the graph obtained by removing the arc vw from D and let T be an r-out- 
branching of D. If vw ^ ^iT), T is an r-out-branching of D' , so suppose vw S A{T). Any 
r-out-branching of D contains the vertex v, and since all paths from r to v contain some vertex 
X G S, some vertex n G S" is an ancestor of v in T. Let T' = (T U uw) \ vw. T' is an out-branching 
of D' . Furthermore, since u is an ancestor of v in T, T' has at least as many leaves as T. For the 
opposite direction observe that any r-out-branching of D' is also an r-out-branching oi D. ■ 

Lemma 3.4 Reduction Rule\^is safe. 

Proof. Let D' be the graph obtained by performing Reduction Rule [5] to a path P m. D. Let 
Pu be the path piPoutuvPinPi and be the path PinPivupiPout- Notice that P^ is the unique 
out-branching of D'\y{Pu)\ rooted at pi and that is the unique out-branching of D'\y{Pu)\ 
rooted at pin. 

Let T be an r-out-branching of D with at least k leaves. Notice that since P is the unique 
out-branching of D\y{Py\ rooted at pi, Q is the unique out-branching of D[V{P)] rooted at pin 
and pi and pin are the only vertices with in-arcs from the outside of P, T[V{P)] is either a path or 
the union of two vertex disjoint paths. Thus, T has at most two leaves in V{P) and at least one 
of the following three cases must apply. 

1. r[y(P)] is the path P from pi to pi. 

2. T[y(P)] is the path Q from pin to Pout- 

3. r[y(P)] is the vertex disjoint union of a path P that is a subpath of P rooted at pi, and a 
path Q that is a subpcith. of rootGd. at pin- 
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In the first case we can replace the path P in T by the path P„ to get an r-out-branching of 
D' with at least k leaves. Similarly, in the second case, we can replace the path Q in T by the 
path to get an r-out-branching of D' with at least k leaves. For the third case, observe that P 
must contain pout since pout = Pi or pi appears before pout on Q and thus, pout can only be reached 
from pi. Similarly, Q must contain pi. Thus, T \ is an r-out-branching oi D\ R. We build an 
r-out-branching T' of D' by taking T\R and letting u be the child of pout and v be the child of pi. 
In this case T and T' have same number of leaves outside of V{P) and T has at most two leaves 
in V{P) while both u and v are leaves in T' . Hence T' has at least k leaves. 

In the other direction let T' be an r-out-branching of D' with at least k leaves. Notice that 
since Pu is the unique out-branching of D'[V{Pu)] rooted at pi, is the unique out-branching 
of D'[V{Pu)] rooted at Pin and pi and pin are the only vertices with in-arcs from the outside of 
V{Pu), T'[V{Pu)] is either a path or the union of two vertex disjoint paths. Thus, T' has at most 
two leaves in V{Pu) and at least one of the following three cases must apply. 

1. T'[V{Pu)] is the path Pu from pi to pi. 

2. T'[V{Pu)] is the path (5„ from pin to pout- 

3. T'[V{Pu)] is the vertex disjoint union of a path P^ that is a subpath of Pu rooted at pi, and 
a path Qu that is a subpath of Qu rooted at pm- 

In the first case we can replace the path Pu in T' by the path P to get an r-out-branching of D 
with at least k leaves. Similarly, in the second case, we can replace the path Qu in T' by the path 
Q to get an r-out-branching of D' with at least k leaves. For the third case, observe that Pu must 
contain pout since Pout = Pi or pi appears before pout on Qu and thus, pout can only be reached 
from pi. Similarly, Qu must contain pi. Thus, T' \ {u,v} is an r-out-branching of D' \ {u,v}. Let 
X be the vertex after pout on P, and let y be the vertex after pi on Q. Vertices x and y must be 
distinct vertices in R and thus there must be two vertex disjoint paths Px and Qy rooted at x and y 
respectively so that V{Px)L)V{Qy) = R. We build an r-out-branching T from {T'\{u, v})U PxUQy 
by letting x be the child of pout and y be the child of pin- In this case T' and T have the same 
number of leaves outside of V{P) and T' has at most two leaves in V{Pu) while both the leaf of 
Pu and the leaf of Qy are leaves in T. Hence T has at least k leaves. ■ 



B Proofs moved from Section [4] 

Lemma 4.5 There is a nice forest in P. 

Proof. We define a subgraph F of D[V{P)] as follows. The vertex set of F is V{P) and an arc 
PtPs is in A{F) \i ps ^ S and t is the largest number so that ptPs G ^{D). Notice that all arcs of 
F are covered by property (6) in the definition of a nice forest. 

We prove that P is a forest. Suppose for contradiction that there is a cycle C in F. By definition 
of F every vertex has in-degree at most 1, C must be a directed cycle. Since every vertex in S 
has in-degree in C H 5" = 0. Consider the highest numbered vertex pi on C. Since P has no 
forward arcs, Pi-i is the predecessor of pi in C . The construction of F implies that there can not 
be an arc pqPi where q > i m. A{D). Also, pi does not have any in-arcs from outside of P. Thus, 
Pi-i disconnects pi from the root. Hence, by Rule [2]p,jpj_i A^D). Let pj be the predecessor of 
Pi-i in C. Then j < i — 1, since PiPi-i A.[D) and pi is the highest numbered vertex in C. Hence 
j = i — 2. This contradicts that D is a reduced instance since the arc pi-2Pi-i disconnects pi-i 
and Pi from the root r implying that Rule [3] can be applied. Since F is a forest and since every 
vertex in V{P) except for vertices in S have in-degree 1 we conclude that F is a forest of directed 
trees rooted at vertices in 5. Since F is a forest and P has no forward arcs, F is a nice forest. ■ 
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Lemma 4.6 Let F he a nice forest of P. There are at most 5{k — 1) key vertices of F. 

Proof. By the proof of Corollary [32] there is an r-out-tree Ts with {V{Ts) D ViP)) C (S* U {pi}) 
and {A{Ts) n A{P)) = 0, such that all vertices in 5 \ {pi} are leaves of Ts- We build an r-out-tree 
Tp = {V{Ts) U V{P),A{Ts) U A{F)). Notice that every leaf of F is a leaf of Tp, except possibly 
for pi. Since D is a no-instance Tp has at most k — 1 leaves and k — 2 vertices with out-degree 
at least 2. Thus, F has at most k leaves and at most k — 2 vertices with out-degree at least 2. 
Hence the number of vertices in F whose parent in F has out-degree at least 2 is at most 2k — 2. 
Finally, by Corollary 14.21 |5| < k. Adding up these upper bounds yields that there are at most 
k-l + k-2 + 2k-2 + k = 5{k-l)]iey vertices of F. m 



Corollary 4.8 No arc PiPi+i is a forward arc of F. 

Proof. If PiPi+i is a forward arc of F then there is a path from pi to Pi+i in F. By Observation 
Wn\piPi+i is the unique path from pi to p^+i in D\V{P)\. Hence PiPi+i G -^[F) contradicting that 
it is a forward arc. ■ 



Observation 4.10 If neither pi nor pi^i are key vertices, then either piPi^i ^ A{F) or pi^ipi^2 ^ 
A{F). 

Proof. Assume for contradiction that PiPi+i E ^iP) and E ^i^)- Since neither pi 

nor Pi+i are key vertices, both Pi+i and Pi-^2 niust have in-degree 1 in D. Then the arc PiPi+i 
disconnects both pj+i and Pi+2 from the root r and Rule [3] can be applied, contradicting that D is 
a reduced instance. ■ 



Lemma 4.11 V{P') induces a directed path in P. 

Proof. We first prove that for any arc piPi+i G -^{P') such that PiPi+i ^ ^{F), there is a path from 
Pi+i to Pi in P. Suppose for contradiction that there is no path from Pi+i to pi in F, and let x be the 
parent of pj+i in P. Then PiPi+i is not a backward arc of P and hence F' = {P\ xpi+i) U {piPi^i} 
is a forest of out-trees rooted at vertices in 5. Also, since pi+i is not a key vertex, x has out-degree 
1 in F and thus x is a leaf in P'. Since pi is not a leaf in P, P' has one more leaf than P. Now, 
every vertex with out-degree at least 2 in F has out-degree at least 2 in P' . Additionally, pi has 
out-degree 2 in F'. Hence F' is a nice forest of P with more leaves than P, contradicting the choice 
of P. 

Now, notice that by Observation 14.71 anv path in from a vertex u £ V{P') to a vertex 

V G yiP') that contains a vertex w ^ V{P') must contain either the arc px-iPx or the arc pyPy+i. 
Since neither of those two arcs are arcs of P it follows that for any arc PiPi+i G ^{P') such that 
PiPi+i ^ A{P), there is a path from pi^i to pi in F[V{P')]. Hence F[V{P')] is weakly connected, 
that is, the underlying undirected graph is connected. Since every vertex in V{P') has in-degree 1 
and out-degree 1 in F we conclude that P[V{P')] is a directed path. ■ 



Observation 4.12 For any pair of vertices Pi,Pj G V{P') if i < j — 2 then pj appears before pi in 
Q'. 

Proof. Suppose for contradiction that pi appears before pj in Q'. By Observation l4.7l p.,p,-|-i pi+'? ■ ■ - Pj 
is the unique path from pi to pj in D[V{P')]. This path contains both the arc PiPi+i and pi^iPi-^2 
contradicting Observation 14.101 ■ 



Lemma 4.13 All arcs of D[V{P')] are contained in A{P') U A{F). 
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Proof. Since P has no forward arcs it is enough to prove that any arc pjPi G A{D[V {P')]) with 
i < j is an arc of F. Suppose this is not the case and let pq be the parent of pi in F. We know 
that Pi has in-degree at least 2 in D and also since pi is not a key vertex pq has in-degree one in 
F. Hence by definition of F being a nice forest, we have that for every t > q, ptPi ^ ^(-C). It 
follows that i < j < q. By Lemma 14.111 F[V(P')] is a directed path Q' containing both pi and pj. 
If appears after pi in Q' , Observation 14.12] implies that i = j — 1 and that pj has in-degree 1 in 
D since -F is a nice forest. Thus pi separates pj from the root and Rule [2] can be applied to pjPi 
contradicting that D is a reduced instance. Hence pj appears before pi in Q'. 

Since pj is an ancestor of pi in F and Pq is the parent of pi in F, pj is an ancestor of Pq in 
F and hence pq G V{Q') = V{P'). Now, pj comes before pq in Q' and j < q so Observation 14.121 
implies that q = j + 1 and that pq has in-degree 1 in D since F is a nice forest. Thus pj separates 
Pq from the root r and both pjpi and pqPi are arcs of D. Hence Ruled] can be applied to remove 
the arc pqPi contradicting that D is a reduced instance. ■ 

Observation 4.15 Let Q' = F[V{P')]. For any pair of vertices u,v such that there is a path 
Q'[uv] from u to v in Q' , Q'[uv] is the unique path from u to v in D[V{P')]. 

Proof. By Lemma [4.111 Q' is a directed path /1/2 . . . /|p/| and let Q'[fifi] be the path /1/2 ■ ■ ■ fi- 
We prove that for any i < \Q'\, fifi+i is the only arc from V{Q'[fifi]) to l^((5'[/j+i/|P'|])- By 
Lemma 14.131 all arcs of D[V{P')] are either arcs of P' or arcs of Q'. Since Q' is a path, fifi+i is 
the only arc from V{Q'[fifi\) to l^((5'[/i+i/|p'|]) in Q' . By CoroUarv 14.81 there are no arcs from 
y{Q'[fifi\) to V{Q'[fi+if\pi\]) in P', except possibly for fifi+i. ■ 

Corollary 4.17 There are at most 14(/c — 1) vertices in P' with arcs to vertices outside of P' . 

Proof. By Lemma 14.31 there are at most 7{k — 1) vertices that are endpoints of arcs originating in 
P' . By Lemma 14.161 each such vertex is the endpoint of at most 2 arcs from vertices in P'. ■ 

Lemma 4.19 Let D be a reduced no-instance to Rooted /c-Leaf Out-Branching. Then 
\V{D)\=0{k^). 

Proof. Let T be a BFS-tree of D. T has at most k — 1 leaves and at most k — 2 inner vertices with 
out-degree at least 2. The remaining vertices can be partitioned into at most 2 A; — 3 paths Pi . . . Pt 
with all vertices having out-degree 1 in T. We prove that for every q S {1, . . . ,t}, \Pq\ = 0{k'^). 
Let P be a nice forest of Pq with the maximum number of leaves. By Lemma 14.61 F has at most 
5 (A; — 1) key vertices. Let pi and pj be consecutive key vertices of F on Pq. By Observation 
14.101 there is a path P' = PxPx+i ■ ■ - Py containing no key vertices, with x < i + 1 and y > j — 1, 
such that neither p^-iPx nor pyPy^i are arcs of F. By Lemma 14.181 |P'| < 154(/c — 1) -|- 10 so 
\Pq\ < (5(A:-1)-M)(154(A;-1)-M0) + 3(5(A:-1)). Hence, \ V{D)\ < 2A;(5A;(154(A: - 1) 10 + 3)) < 
1540A;3 = 0(A;3). . 

C Proofs moved from Section [5] 

Observation 5.4 Let D = iy,Ai U A2) he a nice willow graph. Every out-branching of D with 
the maximum number of leaves is rooted at the top vertex pn 

Proof. Let P = piP2 ■ ■ - Pn be the stem of D and suppose for contradiction that there is an out- 
branching T with the maximum number of leaves rooted at pi, i < n. Since D is a nice willow 
D' = (^,^42) has a pn-out-branching T'. Since every arc of A2 is a back arc of P, T'[{vj : j > i}] 
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is an p.„-out-branching of D[{vj : j > i}]. Then T" = {V, {vxVy £ A{T') : y > i} U {vxVy £ A{T) : 
y < i}) is an out-branching of D. If i = n — 1 then p„ is not a leaf of T since the only arcs going 
out of the set {pn,Pn-i} start in p„. Thus, in this case, all leaves of T are leaves of T" and 
is a leaf of T" and not a leaf of T, contradicting that T has the maximum number of leaves. ■ 

Lemma 5.5 /c-Leaf Out-Tree in nice willow graphs is NP-complete under Karp reductions. 

Proof. We reduce from the well known NP-complete Set Cover problem [19] . A set cover of 
a universe U \s a. family T' of sets over U such that every element of u appears in some set in 
JT'. In the Set Cover problem one is given a family T = {Si, 52, . . . Sm} of sets over a universe 
J7, \U\ = n, together with a number h < m and asked whether there is a set cover T' d T with 
\J-'\ < 6 of [/. In our reduction we will assume that every element of IJ is contained in at least one 
set in T . We will also assume that b < m — 2. These assumptions are safe because if either of them 
does not hold, the Set Cover instance can be resolved in polynomial time. From an instance 
of Set Cover we build a digraph D = {V, A\ U A^) as follows. The vertex set 1/ of D is a root 
r, vertices Sj for each 1 < i < m representing the sets in T ^ vertices e^, 1 < i < n representing 
elements in \J and finally 2 vertices p and p' . 

The arc set A2 is as follows, there is an arc from r to each vertex Sj, 1 < i < ?n and there 
is an arc from a vertex Sj representing a set to a vertex Cj representing an element if G Si. 
Furthermore, rp and rp' are arcs in A2. Finally, we let A\ = {ej+iej : 1 < i < n} U {sj+iSj : 1 < 
i < m} U {eiSm, sip,pp' ,p'r}. This concludes the description of D. We now proceed to prove that 
there is a set cover T' d T with \T'\ <h\i and only if there is an out-branching in D with at least 
n -|- m -1- 2 — 6 leaves. 

Suppose that there is a set cover T' <Z T with < b. We build a directed tree T rooted at 
r as follows. Every vertex Si, 1 < i < m, p and p' has r as their parent. For every element ej, 
1 < i < n we chose the parent of ej to be Si such that ej G Si and Si G J-'' and for every i' < i 
either Si' ^ or ej ^ Si'. Since the only inner nodes of T except for the root r are vertices 
representing sets in the set cover, T is an out-branching of D with at least n + m -\- 2 — b leaves. 

In the other direction suppose that there is an out-branching T of D with at least n + m-\-2 — b 
leaves, and suppose that T has the most leaves out of all out-branchings of D. Since D \s a, nice 
willow with r as top vertex. Observation 15.41 implies that T is an r-out-branching of D. Now, if 
there is an arc ej+iej G A{T) then let Sj be a vertex such that Ci G Sj. Then T' = {T\eiJ^\ei)U s jCi 
is an r-out-branching of D with as many leaves as T. Hence, without loss of generality, for every i 
between 1 and n, the parent of in T is some Sj. Let J-' = {Si : Si is an inner vertex of T}. T' is 
a set cover of U with size at most n + m + 2 — [n + m-\-2 — b) = b, concluding the proof. ■ 
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